Folding of a fractal globule — the key for artificial molecular machines design 
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Folding and unfolding of a fractal (crumpled) polymer globule, which is recently proved to be 
a condensed structure of DNA in human chromosomes, is described as a cascade of equilibrium 
phase transitions in a hierarchical system. The fractal globule is characterized by a set of order 
parameters, encoding the crumples of different hierarchical levels. These order parameters have 
natural representation in terms of coordinates in the "ultrametric space" . Unfolding manifests itself 
in a sequential loss of stability of hierarchical levels with the temperature change, and is described by 
a master equation for a stochastic process with the jumps given by the Parisi-type matrix. Studying 
the relaxation properties of the elastic network of contacts in a fractal globule, we show that the 
dynamic properties of hierarchically folded polymer chains in globular phase are similar to those of 
molecular machines. 

PACS numbers: 61.41.+e, 64.60. al, 64.60. aq, 87.15. Cc 



A notion of molecular machine (MM) is usually at- 
tributed to a nanoscale structure able to convert per- 
turbations of fast degrees of freedom into a slow motion 
along a specific path on a low-dimensional manifold - 
see Molecular machines possess various functions: 

they create or break chemical bonds, move mesoscopic 
objects, etc. Rich diversity of MMs is produced in a liv- 
ing world. Globular proteins formed by polypeptides of 
hundreds units look as simplest examples of MM. The 
ability of proteins to manipulate by solitary charges, or 
atoms against fluctuations, is the main requirement for 
reproduction of living systems at a molecular level [1, Q . 

Is it possible to design molecular machines artificially? 
This challenging question addresses to the prospects of 
artificial algorithmic chemistry. In particular, it concerns 
the problem of overcoming the "complexity threshold" in 
prebiology, meaning that even a "primary" cell with min- 
imal machinery of biological replication remains hope- 
lessly complex for its own appearance beyond the world 
of MMs (see, for example, (g). So, the question whether 
nonspecific polymers can spontaneously fold into MMs is 
still open. In this paper we propose the constructive way 
of creation one family of such structures. 

In the definition of MMs we follow the work Q , where a 
MM is specified by a particular relaxation dynamics of its 
elastic network. Let a molecular structure be represented 
by a spatial distribution of a set of nodes i ~ 1, N and 
undirected links between them, encoded in an adjacency 
matrix A. The elements of A are: a.y = 1 for a link 
between i and j, and = otherwise. In elastic net- 
works, linked nodes are subjected the action of elastic 
forces that obey the Hooke's law. In the over-damped 
limit, the velocity of each node is proportional to the sum 
of clastic forces applied to it, so the relaxation is: 
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where = Ri it) is the current position of the i-th node 

and R,[°^ is its equilibrium position and = jH~r^7[- 

For small deviations from equilibrium, jr^l = |Ri — R-i |j 
nonlinear equations ([T]) can be approximated by linear 
ones 
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where the strain tensors, A^j, are the building blocks of 
the 3A^ X 3A^ linearized matrix A. The relaxation of a 
system under small perturbations is a sum of independent 
normal modes, Yi{t) = ^^^j^ ri(0) cxp (— A^i), where 
Afc > and arc the eigenvalues and eigenvectors of the 
matrix A. The smallest nonzero eigenvalues describing 
the relaxation of the system along the slowest degrees of 
freedom, are crucial for the very definition of MMs. 

The distinguished features of MMs became transparent 
from the study of protein elastic network relaxation 0]. 
Being perturbed, a protein molecule quickly reaches a 
low-dimensional attracting manifold spanned by slowest 
degrees of freedom, and then slowly relaxes to the equi- 
librium along a particular path in this manifold. The ex- 
istence of a low-dimensional manifold with a large basin 
of attraction and wide spectral gap separating slowest 
(soft) and fast (rigid) modes characterize the MMs. 

The polymer globules with relaxation similar to MMs 
are the targets for our search. The globules obtained in 
the ordinary coil-to-globule phase transition have almost 
random clastic networks with many local metastable 
states and small basins of attraction. The soft and rigid 
relaxation modes of random networks are not well sepa- 
rated 3 which means that the randomly folded equilib- 
rium polymer globules do not possess characteristics of 
MMs. 

The search for candidates for a prototype of an artifi- 
cial MM is based on the theoretical analysis of a protein 
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dynamics established by a variety of experimental data 
on CO-myoglobin binding M], and on spectral diffusion 
in deeply frozen proteins These data point to the 
fact that protein fluctuational dynamics is similar to the 
ultrametric random walk in a wide temperature range 
from 4"K up to 300°K [iOl- This result definitely signi- 
fies that proteins may have self-similar hierarchical order 
hidden in energy landscapes and conformational states 
(see also [T2|V 

A hierarchical organization, being translated to a net- 
work of contacts, means that a corresponding adjacency 
matrix has a block-hierarchical form similar to the Parisi 
matrix [Tsl . [13. Topolo gical features of random block- 
hierarchical networks and their relevance to hierarchical 
systems of different nature, have been discussed in 

[13- 

[Toj . Recent work (isj has provided experimental evidence 
of the hierarchical structural organization of a DNA chain 
in a human genome in a form of a fractal globule (FG), 
which was predicted theoretically in [l^ as a collapsed 
state of unknotted ring polymer. The apparent simplic- 
ity of a FG creation (the abrupt change of the solvent 
quality provokes a collapse transition in a long unentan- 
gled polymer ring which initially is in a coil state), has 
led to regard FG as a strong candidate for a molecular 
machine. Understanding the functioning of such macro- 
molecular structures is impossible without developing an 
adequate language for the description of the process of 
creation and destruction of crumples on different hierar- 
chical levels. Here we make first modest steps towards 
the construction of the equilibrium statistical theory of 
FG formation and melting. 

The deep relation between phase transitions in a sys- 
tem of Ising spins with block-hierarchical interactions 
[20I [2l| and critical phenomena in block-hierarchical net- 
works, discussed in [l9[, allows us to suggest the new way 
of analytic description of folding and unfolding of the 
fractal globule. In the hierarchical Ising system, when 
temperature decreases, a hierarchy of phase transitions 
occurs producing a tree of hierarchically embedded sets of 
correlated spins. The theory of FG formation is based on 
the same ideas. It should be emphasized that the consid- 
eration presented below is not a theory of coil-to-fractal 
globule phase transition, but rather the description of a 
FG formation and melting, since we are interested in for- 
mation and destruction of hierarchical set of folds only, 
which happens within the globular phase of a polymer. 

Consider a polymer chain in the D = 3-dimensional 
space. The non-phantomness leads to two types of in- 
teractions: bulk interactions vanishing in infinitely thin 
chains, and topological interactions, preserved even for 
zero's thickness chains. At T > 6*, a polymer of N seg- 
ments, each of length ^, is in a coil phase. While at 
T < 9, a macromolecule collapses into a weekly fluctuat- 
ing drop-like globule of size Rgi - iN^/^ (HHI). In 
the globular phase formed by a linear chain with open 
ends, all subchains of length £s, where s < N"^/^ , look as 
Gaussian coils due to the De Gennes theorem of screening 



of volume interactions in the melt. 

However, for linear unknotted polymer rings the situa- 
tion is quite different. The topological constrains play the 
role of auxiliary repulsion and &tT < 9 there exists a cer- 
tain critical length, ^sq, (depending on temperature and 
bulk interaction), such that relatively small chain parts 
of the length Isq collapse. Define ^so^link segments as 
the block monomers, constituting the Ist-level crumples. 
The chain segments containing several consecutive 1st- 
Icvel crumples collapse "in its own volume" , forming the 
2nd-level crumples, they then collapse in the 3rd-level 
crumples, and so on. The hierarchical chain packing is 
completed when all £so-link segments belong to a single 
crumple of a highest level. Such a packing means that 
a chain path is space-filling with the fractal dimension 

In a hierarchically folded macromolecule each sp-link 
unit can be characterized by a set of indices specifying to 
which particular Ist-level fold (embedded into particular 
2nd-lcvel fold, that, in turn, is embedded into particular 
3rd-levcl fold, etc) this unit belongs. Such indexing is 
coded by a path on a Cayley tree - see the Fig[TJ The 
boundary nodes (leaves) of this tree constitute a space 
of states for the units, and each subtree corresponds to a 
particular crumple. In case of regularly branching Cayley 
tree with the branching index p + 1 (p = 2 in the figure) 
and 7 = 1, 2, enumerating the levels of hierarchy, each 
7-level crumple consist of p"*" units. Hence, assuming that 
any two states are distinguished only at the scale ^ p'' 
of a maximal fold separating them, we end up with the 
ultrametric space of states. In such a description, the 
conformations of hierarchically folded chain are in bijec- 
tion with the trajectories of a walker in an ultrametric 
space, which is the boundary of the tree. Thus, the chain 
conformation (hierarchy of folds) is fully characterized by 
the leaves (black dots) in the FigU^. 
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Figure 1: (a) Space of states in a fractal globule. Each chain 
monomer is characterized by its position on the boundary 
of the ultrametric space (black dots); (b) Block-hierarchical 
structure of transition matrix (p = 2). 

Let fi{s) be the probability to find a walker in the state 
i at the boundary of the tree after s steps (0 < s < N), if 
at s = it was at iq. Assuming the random walk to be a 
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homogeneous Markov chain, we have a master equation 

OMs) 



ds 



Y.wmf,{s)-[Y,wm]Ms) (3) 



Si^ig, where 



subject to the initial condition fi{s = 0) 
■"^(^Ij) — ''^(j I*) is the probabihty to jump between states 
i and j in one step. The uhrametric random walk is spec- 
ified by the Parisi-like transition matrix W of a block- 
hierarchical form, shown in the FigIT}) and obeying the 
strong triangle inequality (see, for example, [l!])- For a 
random walk at the boundary of a regular p-branching 
Cayley tree, each block of the matrix W of the hierarchi- 
cal level 7 consists ofp'^ equal elements, that depend on 7 
only. For example, w(3|4) = wi, u'(3|7) = W3, u'(5|8) = 
W2, etc - compare FigHJleft) and FiglTJright). The en- 
ergy landscape of the ultrametric space is specified by a 
choice of transitions w-y on the hierarchical level, 7. The 
landscape is supposed to be linear i.e. = p^("+i)T, 
where a > is some parameter. 

The conformations of the hierarchically folded chain 
of length L = £N are parameterized by the iV-step tra- 
jectories on the boundary of the Cayley tree. The tran- 
sition probability fi{s = N) should be understood as a 
measure of all conformations of the hierarchically folded 
s-step chain, whose first unit is in an initial state io and 
the last one, s = iV, to be in the state i. The function 
fi{s) satisfies the Kolmogorov-Feller equation ([3]). It is 
known ([1^) that the symmetric transition operator in 
the right hand side of the equation 1^ is diagonalizible by 
the p-adic Fourier transformation, and for unbounded ul- 
trametric space its non-zero eigenvalues are ~ —p 



Since all eigenvalues are negative, the hierarchically 
folded conformations given by ^ have a well defined 
ground state with a specific set of eigenvalues, one for 
each fold of a given scale. The chain conformations are 
subjected to the action of a hierarchy of strong con- 
strains, and the temperature is supposed to be much 
lower than the critical temperature of the coil-globule 
phase transition. 

The unfolding at some scale 7cr means the destruc- 
tion of hierarchical tree-like set of states above jcr- 
This can be easily achieved by introducing an auxil- 
iary temperature-dependent repulsion between the units. 
The truncation of these interactions depends on the scale 
of crumples. To satisfy this requirement, the interaction 
potential should be again of a block-hierarchical form 
U with elements depending on the ultrametric distances 
between the units only. Thus, we can generalize ([3]) to: 



^f^is) 

ds 



^ wmf.is) - [Y^^m] M-^) + rvfd 
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where r = ^ and U/j(s) = w{i\j)fj{s) 

Again, using the p-adic Fourier technique one can di- 
agonalize the transition operator in the right hand side 



of (|4]). The nonzero eigenvalues are: 



(5) 



The solutions of the equations A-y(r) = define a se- 
quence of critical temperatures Ti < r2 < ... < Tmax 
at which the hierarchically folded conformations lose the 
stability. At Ti the largest fold, i.e. the highest level 
Imax becomes unstable and the fractal globule unfolds 
into a chain with smaller crumples. At T2 > Ti, these 
crumples of level 7 = jmax ^ 1 unfold, etc, until T^ax is 
reached, at which the crumple 7 = 1 becomes unstable 
and the chain unfolds completely. If the temperature is 
decreasing, the hierarchical folding of a polymer chain 
is characterized by a sequence of critical temperatures, 
Tmax > ... > T2 > Ti, at which the sequence of folds of 
hierarchical levels 7 = 1,2,..., jmax is formed. 

We have used the Monte-Carlo method in the 3D con- 
tinuous space [13 to simulate hierarchical folding of a 
polymer chain. A polymer of 202 monomers, among 
which we have chosen 32 specific units, is simulated in 
a large enough box without periodical boundary condi- 
tions. The chain is described by a "beads-on-the springs" 
model with a standard set of interaction potentials (see 
[2?! for details), accounting for rigidity and volume in- 
teractions. In addition, a hierarchical potential Uy oper- 
ating between the specific units is introduced. It ensures 
that at each hierarchical level 7, the formation of clusters 
of specific units becomes possible only if two clusters of 
the previous level 7 — 1 are close to each other in space, 
and vice versa: a cluster of a level 7 — 1 cannot disap- 
pear if it is a part of an existing cluster of the level 7. A 
spatial distance between two clusters of the level 7 is de- 
termined as the smallest distance between nodes of these 
clusters. We have used a short-range square-well poten- 
tial with the depth of the well decreasing upon increasing 
the hierarchical level 7, but we have checked that the par- 
ticular choice of this hierarchical potential is not crucial. 
However, the rules of hierarchical formation of clusters 
are very important. These rules imitate the hierarchical 
constrains imposed on the chain conformations under the 
polymer crumpling. The chain conformations are evolved 
via Metropolis algorithm. Equilibrating them, we con- 
struct an elastic network: specific units become the nodes 
of this network connected by new springs if the distances 
between them are less than three diameters of a unit; all 
other units are erased. A typical conformation of hierar- 
chically collapsed polymer chain and its elastic network 
are shown in Figl2^ and Figl^Jj respectively. The bench- 
marks A, B and C are used to visualize the relaxation 
trajectories of the elastic network (see Q for details) . 

To characterize our system we have introduced the 
hierarchy of order parameters, = z^^'^l-y) ^'^ ~ 
1,2, ...,^max)i where Z{'-f) is the number of actually 
formed clusters on a level 7 averaged over a set of hier- 
archically folded structures, and Zmax{j) is the maximal 
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Figure 2: Left: Typical shape of hierarchically folded globule. 
Representative nodes are shown in red, other monomers are 
green; Right: The elastic network; blue, green and yellow 
colors mark the nodes constituting the 3D coordinate system 
for the visualization of the network dynamics. 

number of possible clusters on the level 7, < g-^ < 1 
for any 7. In the Fig|3ljleft) the simulations for the scries 
of order parameters 51,..., 55 as the function of the in- 
verse temperature, /3, are shown. By increasing /? we see 
the sequence of transitions: first, the smallest crumple 
is formed (black squares), then the 2nd crumple appears 
(red circles), etc. The typical eigenvalue spectra of elas- 
tic networks of hierarchically collapsed chains is shown 
in the FiglSUright). Note a large spectral gap separating 
the slowest mode from the rest of spectrum (A2/A1 ~ 7, 
A2/A3 « 1). 




P 



Figure 3: Left: Numerical observation of the hierarchy of 
transitions under the fractal globule formation: {3 ~ {T~^ — 
9~^); Right: Eigenvalue spectra of the network of hierarchi- 
cally folded chain and ordinary globule normalized to the low- 
est nonzero eigenvalue. 

In the FigH] a 3D view of relaxation trajectories of 
the hierarchically folded elastic network ([T]) is shown. 
The relaxation dynamics is such that the hierarchically 
folded globule quickly relaxes to a low-dimensional (one- 
dimensional, in our demonstration) manifold with a large 
attracting basin and then slowly moves along it to the 
equilibrium. Such a behavior is very typical for molecu- 
lar machines. It should be pointed out that the elastic 
network of the fractal globule is highly anisotropic: along 



the 5u23-axis it is 10^ more rigid that along Sui2- and 
iJitis-axes. In the ordinary globule such anisotropy is 
absent and its elastic network does not have any low- 
dimensional attracting manifold for phase trajectories. 
To represent the slowest degrees of freedom, we chose 
in the network a triangle ABC (marked by blue, green, 
and yellow nodes in the Figl2]) such that the changes 

of the distances along the sides AB ((5ui2/mi2'') and BC 
((5ui3/m^3^) correspond to the slowest relaxation modes. 
The particular selection of benchmarks A, B and C is 
as follows. Computing the spectrum of elasticity matrix, 
we extract the eigenvectors, ei and 62, corresponding to 
slowest relaxation modes. Then the points A, B and C 
are chosen such that the projections of the average change 
of the distances AB onto ei and BC onto 62 are maximal. 




Figure 4: A set of relaxation trajectories of a hierarchically 
folded globule. Insert: Projection of relaxation trajectories 
on the plane 5u23/uf^ (fast mode) vs Sui2/u^i2 i^^^ slowest 
mode). 

The structure considered here is typical for a wide 
family of hierarchical globules. The diversity concerns 
mainly the number of principal soft modes, manifested in 
the dimensionality of the attracting manifold. Typically, 
these manifolds are two- or three-dimensional. This fact 
might be important to get functional variability without 
altering the structural archetype. The considered in this 
work sample of FG does not realize yet any specific func- 
tion except the conversion of thermal random perturba- 
tions of fast degrees of freedom into slow motion along a 
well-defined path. This capability allows, in principle, to 
manipulate by single atoms, or molecules in a controlled 
manner. The hierarchically folded domain might play 
in a molecular machine the same constructive role as an 
engine in a mechanical device. 
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